Improving the Cross-Lingual Generalisation in Visual Question Answering

نویسندگان

چکیده

While several benefits were realized for multilingual vision-language pretrained models, recent benchmarks across various tasks and languages showed poor cross-lingual generalisation when multilingually pre-trained models are applied to non-English data, with a large gap between (supervised) English performance (zero-shot) transfer. In this work, we explore the of these on zero-shot visual question answering (VQA) task, where fine-tuned visual-question data evaluated 7 typologically diverse languages. We improve transfer three strategies: (1) introduce linguistic prior objective augment cross-entropy loss similarity-based guide model during training, (2) learn task-specific subnetwork that improves reduces variance without modification, (3) training examples using synthetic code-mixing promote alignment embeddings source target Our experiments xGQA multimodal transformers UC2 M3P demonstrates consistent effectiveness proposed fine-tuning strategy languages, outperforming existing methods sparse models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cross-Lingual Question Answering by Answer Translation

We approach cross-lingual question answering by using a mono-lingual QA system for the source language and by translating resulting answers into the target language. As far as we are aware, this is the first cross-lingual QA system in the history of CLEF that uses this method—all other cross-lingual QA systems known to us use translation of the question or query instead. We demonstrate the feas...

متن کامل

Cross-lingual Question Answering with QED

We present improvements and modifications of the QED open-domain question answering system developed for TREC-2003 to make it cross-lingual for participation in the CrossLinguistic Evaluation Forum (CLEF) Question Answering Track 2004 for the source languages French and German and the target language English. We use rule-based question translation extended with surface pattern-oriented preand p...

متن کامل

Cross-Lingual Question Answering Using Off-the-Shelf Machine Translation

We show how to adapt an existing monolingual open-domain QA system to perform in a cross-lingual environment, using off-the-shelf machine translation software. In our experiments we use French and German as source language, and English as target language. For answering factoid questions, our system performs with an accuracy of 16% (German to English) and 20% (French to English), respectively. T...

متن کامل

Cross-Lingual Romanian to English Question Answering at CLEF 2006

This paper describes the development of a Question Answering (QA) system and its evaluation results in the Romanian-English cross-lingual track organized as part of the CLEF 2006 campaign. The development stages of the cross-lingual Question Answering system are described incrementally throughout the paper, at the same time pinpointing the problems that occurred and the way they were addressed....

متن کامل

Cross Lingual Question Answering using QRISTAL for CLEF 2008

QRISTAL [10], [13] is a question answering system making intensive use of natural language processing both for indexing documents and extracting answers. It ranked first in the EQueR evaluation campaign (Evalda, Technolangue [4]) and in first rank in French for CLEF 2005, 2006 and 2007 [11], [12], [14]. This article describes the improvements of the system since last year. Then, it presents our...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26574